Jacobian adaptation based on the frequency-filtered spectral energies
نویسندگان
چکیده
Jacobian Adaptation (JA) of the acoustic models is an efficient adaptation technique for robust speech recognition. Several improvements for the JA have been proposed in the last years, either to generalize the Jacobian linear transformation for the case of large noise mismatch between training and testing or to extend the adaptation to other degrading factors, like channel distortion and vocal tract length. However, the JA technique has only been used so far with the conventional mel-frequency cepstral coefficients (MFCC). In this paper, the JA technique is applied to an alternative type of features, the Frequency-Filtered (FF) spectral energies, resulting in a more computationally efficient approach. Furthermore, in experimental tests with the database Aurora1, this new approach has shown an improved recognition performance with respect to the Jacobian adaptation with MFCCs.
منابع مشابه
Individual on-line variance adaptation of frequency filtered parameters for robust ASR
In this paper we address the problem of robust speech recognition. We propose a new method based on the individual variance adaptation of frequency filtered parameters to reduce the deleterious effects of additive narrow-band noise. The method can be interpreted as a spectral weighting that assigns increased importance to the most reliable spectral components, typically the spectral peaks. The ...
متن کاملImproved Jacobian adaptation for fast acoustic model adaptation in noisy speech recognition
This paper describes two algorithms to improve a previously proposed Jacobian adaptation (JA) technique for fast acoustic speech recognizer model adaptation in environmental noise. The rst technique introduces a new bias term, that is a function of the reference noise estimate to account for the mismatch between the reference noise estimate and noise component of the noisy speech spectrum. This...
متن کاملSpeaker Recognition Using Frequency Filtered Spectral Energies
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-bank energies with a simple first or second order FIR filter have proved to be an efficient speech representation in terms of both speech recognition rate and computational load. Recently, the authors have shown that this frequency filtering can approximately equalize the cepstrum variance enhanci...
متن کاملSpeaker verification on the polycost database using frequency filtered spectral energies
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-bank energies with a first or second order FIR filter have proved to be competitive for speech recognition. Recently, the authors have shown that this frequency filtering can approximately equalize the cepstrum variance enhancing the oscillations of the spectral envelope curve that are most effect...
متن کاملSurvey of Graph Energies
Let graph energy is a graph--spectrum--based quantity, introduced in the 1970s. After a latent period of 20--30 years, it became a popular topic of research both in mathematical chemistry and in ``pure'' spectral graph theory, resulting in over 600 published papers. Eventually, scores of different graph energies have been conceived. In this article we...
متن کامل